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Abstract. The following question is due to Chatterjee and Varadhan (2011). Fix < p < r < 1 
and take G ~ Q{n,p), the Erdos-Renyi random graph with edge density p, conditioned to have at 
least as many triangles as the typical Q{n,r). Is G close in cut-distance to a typical Q{n,r)l Via 
a beautiful new framework for large deviation principles in Q{n,p), Chatterjee and Varadhan gave 
bounds on the rephca symmetric phase, the region of (p, r) where the answer is positive. They 
further showed that for any small enough p there are at least two phase transitions as r varies. 

We settle this question by identifying the replica symmetric phase for triangles and more generally 
for any fixed d-regular graph. By analyzing the variational problem arising from the framework of 
Chatterjee and Varadhan we show that the replica symmetry phase consists of all (p, r) such that 
(r'*, hp{r)) lies on the convex minorant of a; i— >■ hp{x^^'^) where hp is the rate function of a binomial 
with parameter p. In particular, the answer for triangles involves hp{^/x) rather than the natural 
guess of hp{x^^^) where symmetry was previously known. Analogous results are obtained for linear 
hypergraphs as well as the setting where the largest eigenvalue of G ~ Q{n,p) is conditioned to 
exceed the typical value of the largest eigenvalue of Q{n,r). Building on the work of Chatterjee 
and Diaconis (2012) we obtain additional results on a class of exponential random graphs including 
a new range of parameters where symmetry breaking occurs. En route we give a short alternative 
proof of a graph homomorphism inequality due to Kahn (2001) and Galvin and Tetali (2004). 



1. Introduction 

The following question was raised by Chatterjee and Varadhan [8] concerning large deviations 
in Q{n,p), the Erdos-Renyi random graph on n vertices with edge density p. 

Fix < p < r < 1 and let G n be an instance of Q(n,p) conditioned on the rare 
event of having at least as many triangles as a typical instance of Q{n,r). Is it the 
case that as n — )• oo the graph G„ is close in cut-distance to a typical G{n, r) graph? 



(A more formal statement, including the definition of the graph cut-metric, is postponed to ^.1 
This amounts to asking whether the likely reason for too many triangles is an overwhelming number 
of edges, uniformly distributed, or some fewer edges arranged in a special structure, e.g., a clique. 
Dubbed replica symmetry and symmetry breaking, resp., the dichotomy between these scenarios 
turns out to depend on p and r. Intriguingly, it was known that for small enough p there are at 
least two phase transitions as r increases from p to 1, with symmetry replica near the two endpoints. 

In this work we analyze the variational problems arising from the framework of Chatterjee and 
Varadhan and obtain a full answer for the question above, as depicted in Fig. [TJ More generally, we 
identify the phase diagram for upper tails of any fixed regular subgraph and derive related results 
in other random graph settings, e.g., exponential random graphs, random hypergraphs, etc. 

1.1. Subgraph densities and spectral radii. Large deviations for subgraph densities in random 
graphs have been extensively studied (see, e.g., [2Sl 03 EH [271 IMl El ESI El as well as [Sj i2BJ and 
the references therein). A representing example which drew significant attention is upper tails 
of triangle counts, i.e., estimating the probability that Q{n,p) has at least (3)?^^ triangles where 
r = {1 + r])p for fixed rj > (allowing p to vary with n), a problem whose understanding is still 
incomplete. The order of the rate function (the normalized logarithm of this probability) when 
p — 7- was only very recently settled: Chatterjee [5j and DeMarco and Kahn |14j independently 
established it to be n?p'^ log{l/p) when p > logn/n, and yet the exact rate function remains 
unknown in this range of p. We now turn to what was known for fixed p, our focus in this paper. 
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Figure 1. Phase diagram for the upper tail of triangle counts. Shaded region is 
the replica symmetric phase; the region to its left is the symmetry breaking phase. 
Previous results ||6|18j established replica symmetry to the right of the dashed curve. 

Clearly, if the total number of edges in Q{n,p) deviates to m ~ {^r then one will arrive at a 
random graph with m uniformly distributed edges featuring the desired number of triangles. Thus, 
the large deviation rate function for encountering (3)?"^ triangles in Q{n,p) is at most hp{r), where 

X 1 — X 

hp(x) := xlog- + (I - x)log for p G (0, 1) and X G [0, 1] (1.1) 

p I — p 

is the rate function associated to the binomial distribution with probability p. However, it is 
possible that other configurations with broken symmetry would give rise to lower rate functions. 

As an application of Stein's method for concentration inequalities, Chatterjee and Dey [6j found 
a range of {p, r) where the large deviation rate function for triangles is equal to hp{r), namely when 
p > 2/(2 + e^/^) ~ 0.31 or when r is suitably close either to p or to 1. This symmetry region was 
explicitly stated in Theorem 4.3] as all pairs {p, r) where (r^, hp{r)) lies on the convex minorant 
of X I— )> hp{x^^^). The breakthrough work of Chatterjee and Varadhan [8j introduced a remarkable 
general framework for large deviation principles in Q(n,p) via Szemeredi's regularity lemma [H] and 
the theory of graph limits by Lovasz et al. [MllMlll]- It expressed the large deviation rate function, 
and moreover the structure of the random graph conditioned on the large deviation, in terms of a 
variational problem on graphons, the infinite-dimensional limit objects for graph sequences. 

Although often this variational problem is untractable, for triangles in the mentioned range of 
{p, r) it was shown in [8j to have a unique and symmetric solution. To formalize this symmetry, we 
say a graph G is close in cut-distance to a typical Q{n,r) graph if all induced subgraphs on a linear 
number of vertices have edge density close to r. More precisely, for a graph G and r G [0, 1] let 

6a{G,r):= sup }^ J eciA, B) - r \A\ \B\ \ , 
A,BGViG) \V{G)\ 

where eG{A,B) is the number of pairs {a,b) £ Ax B with ab G E{G). Chatterjee and Varadhan 
showed that, in the above range of {p,r), if G„, ~ Q{n,p) is conditioned to have at least {^)r^ 
triangles then 5n{Gn,r) — )■ in probability as n — )■ 00. The function x 1— )■ hp{x^^^) governing that 
region is the natural candidate for the phase boundary as the cube-root accounts for the 3 edges of 
the triangle (see, e.g., [151 §4.5.2] for related literature), and indeed Chatterjee and Varadhan asked 
whether this precisely characterizes the full replica symmetric phase. As it turns out, however, the 
replica symmetric phase is strictly larger, being governed instead by x i-)- hp[^/x). (See Fig. m) 
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Figure 2. The phase boundary for counts of d-regular fixed subgraphs in Q(n,p). 

For any graph G let e{G) = \E{G)\, and for any two graphs G and H let hom(ff, G) denote the 
number of homomorphisms from H to G (i.e., maps V{H) — )• V{G) that carry edges to edges). Let 

|hom(g,G)| 

- |y(G)r(^)l 

be the probability that a random map V{H) — )• V{G) is a graph homomorphism. We now state 
our main result on the phase diagram for large deviations in densities of d-regular subgraphs. 

Theorem 1.1. Fix < p < r < 1 and let H be a fixed d-regular graph for some d > 2. Let 
Gn ~ G{n,p) be the Erdos-Renyi random graph on n vertices with edge probability p. 

(i) If the point {r^, hp{r)) lies on the convex minorant of the function x i— )• hp{x^/'^) then 

lim T^logP (t{H,Gn) > r^^'A = -hp{r) 

and furthermore, for every e > there exists some C = C{H,e,p,r) > such that for all n, 
P r)<e\ t{H, G„) > r^^^^) > 1 - e"^"' . 

(a) If the point (r*^, hp{r)) does not lie on the convex minorant of the function x i— ?• hp{x^^^) then 

lim y^logP (t{H,Gn) > r<''A > -hp{r) 



(2) 

and furthermore, there exist e,C > such that for all n, 



inf {5n(G„,s) : < s < l} > e 



t(H,Gn)>r 



e(H) 



> 1 



In particular, when d = 2, case ^ occurs if and only if p < [l + (r ^ — 1)^^^^ ^''^j ^ . 

The boundary curves for various values of d are plotted in Fig. [2} It is easy to verify (Lemma |A.1[ ) 
that the rightmost point in the curve for d-regular subgraphs is {p, r) 



d-i 
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We give an analogous result for large deviations of the spectral radius of an Erdos-Renyi random 
graph. The phase boundary in this case coincides with that of triangles. 

Theorem 1.2. Fix < p < r < 1. Let Gn ~ Gi^jp) be an Erdos-Renyi random graph on n vertices 
with edge probability p, and let Ai(G„) denote the largest eigenvalue of its adjacency matrix, 
(i) Ifp>[l + (r-i - i)i/{i-2r)]-i ^/jg^ 

lim ^logP(Ai(G„) > nr) = -hp{r) 

and furthermore, for every e > there exists some C = C{e,p,r) > such that for all n, 
P {dn{Gn, r)<e\ Ai(G„) > nr) > 1 - e"^"' . 
(ii) Ifp< [1 + (r-i - i)i/{i-2r)]-i ^/^g^ 

lim ^logP(Ai(G„) > nr) > -hp{r) 

and furthermore, there exist e,C > such that for all n, 

P(inf {(5n(G„,s) : < s < l} > e | Ai(G„) > nr) > 1 -e"^"'. 

Both theorems are proved through an analysis of the graphon variational problems rising from the 
framework of Chatterjee and Varadhan [8]. We show that throughout the replica symmetric region 
its unique solution is the symmetric one (a consequence of a generalized form of Holder's inequality), 
whereas elsewhere one can construct graphons that outperform the symmetric candidate. Note 



that Theorem 1.2 addresses spectral large deviations, whereas the framework of [8] was tailored 
for subgraph densities (the recent work [9] broadens it to general random matrix properties with 
respect to an appropriately defined spectral distance. Here we consider concretely large deviations 
in the spectral norm of random graphs) . Fortunately, the results of [8J easily extend to a wide family 
of graph parameters with respect to the cut-metric, including the operator norm, the extension of 
the (normalized) spectral norm to the space of graphons. This generalization is detailed in ^ 

1.2. Exponential random graphs. We now turn our attention to a different random graph 
model, the basic setting of which assigns a probability Pj3{G) to every graph G on n labeled 
vertices as a function of its edge density t{K2, G), its triangles density t{K3, G) and a weight vector 
P = P2) for these two quantities]^ Namely, the graph G appears with the following probability: 

^^^^^ = i^'^'P [(f)(l^iii^2,G) + l32t{Ks,G))^ , (1.2) 

where Z„ is a normalizing factor (the partition function). When /32 > the model favors graphs 
with more triangles whereas triangles are discouraged for (^2 < 0. There is a rich literature on both 
flavors of the model, motivated in part by applications in social networking: the reader is referred 
to [Ml ESI EZl is] as well as [21 [7] and the references therein. 

As shown by Bhamidi, Bresler and Sly [2] and Chatterjee and Diaconis [7j, when /32 > and n 
is large, a typical random graph drawn from the distribution has a trivial structure — essentially 
the same one as an Erdos-Renyi random graph with a suitable edge density. This somewhat 
disappointing conclusion accounts for some of the practical difficulties with statistical parameter 
estimation for such models. It was further shown in [7J that if we allow (^2 to be sufficiently negative, 
then the model does behave appreciably differentially from an Erdos-Renyi model. In this part of 
our work we focus on the case ^2 > 0, and propose a natural generalization that will enable the 
model to exhibit a nontrivial structure instead of the previously observed Erdos-Renyi behavior. 



^The general model allows for an arbitrary (fixed) collection of subgraphs. While the majority of our arguments 
can be extended to the general setting, we focus on the two-term model for simplicity and clarity. 
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Figure 3. The (/3i, /32)-phase diagrams for the exponential random graph model 



in (1.3) with /32 ^ and various values of a > 0, as a special case of Theorem 1.3 
When a < 2/3, symmetry breaking occurs in the shaded region (at least) and replica 
symmetry occurs for Pi > —2. When a > 2/3, replica symmetry always occurs. 

Consider the exponential random graph model which includes an additional exponent a > in 
the exponent of the triangle density term: 

1 



Zn V V 2 



{(3it{K2,G) + P2t{K3,GY 



(1.3) 



We will show that this model exhibits a symmetry breaking phase transition even when /32 > 0. 
When a > 2/3, the generalized model features the Erdos-Renyi behavior, similar to the previously 
observed case of a = 1. However, for < a < 2/3, there exist regions of values of (/3i, /32) for which 
a typical random graph drawn from this distribution has symmetry breaking. As was the case for 
Theorems |1.1| and |1.2[ rather than just triangles we prove this result for any d-regular graph H. 



Theorem 1.3. Let H be a d-regular graph for some fixed d > 2 and fix f3i £ 
be Gn be an exponential random graph on n labeled vertices with law 

PaAGn) = ^ exp f f^^ (/3lt(i^2, Gn) + p2t{H, Gn)") 



Zn 



and (^2, a > 0. Let 



(1.4) 



(a) Suppose a > d/e{H). There exists a subset P = {(/3i, (^(/3i)) : /3i < \og{e{H)a — 1) — e(fl^i-i i 
ofM? for some function (/? : M — t- M such that for every {Pi, P2) € M x (0, 00) \ T there exists 
< u* < 1 so that 5n{Gn,u*) — >• almost surely as n ^ 00, and for every (/3i,/32) G T there 
exist < ul < U2 < 1 such that min{(5n(G'n, ^i) , 6niGn,U2)} — )■ almost surely as n ^ 00. 
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Figure 4. Linear d-regular hypergraphs: a cycle {d = 2) and the Fano plane {d = 3). 

(b) Suppose < a < d/e{H) and /3i > \og{d — 1) — d/{d — 1). Then there exists < n* < 1 such 
that for every e > there is a C > such that 6a{Gn, u*) — )• almost surely as n ^ oo. 

(c) Suppose < a < d/e{H) and /3i < \og{d — 1) — d/{d — 1). Then there exists an open interval 
of values /32 > with the property that there exist e, C > such that for all n, 



(inf {6n{Gn, s) : < s < l} > e) > 1 - e 



2 



Note that, as in the previous theorems, for the replica symmetric phase one can quantify the 
rate of convergence, e.g., when 6a{Gn,u*) — )• almost surely we in fact have that for any e > 
there exists some C > so that F{6o{Gn,u*) < e) > 1 — e"*^" for every n. 



1.3. Linear hypergraphs. Theorem 5.1 (see ^ extends Theorem |1.1| to the setting of random 



hypergraphs. A A;-uniform hypergraph G consists of a set V{G) of vertices and a set E{G) of 
hyperedges, where each hyperedges is a A:-element subsets of V{G). It is said to be d-regular if 
every vertex is incident to exactly d edges, and linear if every two vertices are incident to at most 
one common hyperedge (see Fig. |4] for examples of d-regular 3-uniform linear hypergraphs). The 
random hypergraph Q^^\n,p) is formed by starting with n vertices and adding /c-element subset of 
the vertices as a hyperedge independently with probability p. 

In order to generalize our arguments to large deviations in the density of if, an arbitrary d-regular 
linear hypergraph, one must first extend the theory developed by Chatterjee and Varadhan [8j to 
A:-uniform hypergraphs. Thanks to the linearity of the hypergraph H there is a simple extension of 
Szemeredi's regularity lemma to hypergraphs that behaves well with respect to the density of H. 

1.4. Graph homomorphisms. Alon pj conjectured in 1991 that the number of independent sets 
in a d-regular graph G, denoted i(G), satisfies i{G) < i{K(i^(i)^^^^^\/^'^'^\ i.e., it is maximized when 
G is a union of complete bipartite graphs Kd^^. Kahn |3Dj verified this when G is bipartite using 
an ingenious application of the entropy method (specifically. Shearer's inequality). This result was 
thereafter extended by the second author [IB] to all d-regular graphs via an elementary bijection. 
Using the entropy method of Kahn, Galvin and Tetali |21j extended [30j to graph homomorphisms: 

Theorem 1.4 (Galvin and Tetali J21], following Kahn [SU]). Let G he a simple d-regular bipartite 
graph, and let H be a graph, possibly containing loops. We have 

liom{G,H) < hom(/Q,rf,i/)l^(^)l/(2«'). (1.5) 

Observe that this inequality generalizes the independent set result, since i{G) = hom(G, O — •). 
Previously, all the known proofs of these inequalities relied on entropy techniques. Regarding a more 
elementary proof, Kahn [30] wrote that "one would think that this simple and natural conjecture... 
would have a simple and natural proof." As a related aside, in ^we give a short new entropy- free 



proof for (1.5) as an immediate consequence of the generalized form of Holder's inequality. 
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1.5. Organization. In f|2]we review graph hmits as weU as the large deviation principle for random 
graphs developed by Chatterjee and Varadhan. In ^ we apply the machinery of Chatterjee and 
Varadhan to prove Theorems 1.1 and 1.2, determining the phase diagram for large deviations of 
subgraph densities and the largest eigenvalue in Q{n,p), resp. Section [4] focuses on exponential 
random graphs and gives the proof of Theorem |1.3[ In fjs] we extend Theorem |1.1| to densities 
of linear hypergraphs in random hypergraphs. Section [6] contains the short new proof of the 
inequalities of Kahn and Galvin-Tetali (Theorem 1.4). Finally, in ^we discuss some open problems. 



2. Graph limits and the framework of Chatterjee- Varadhan 

The theory of Chatterjee- Varadhan reduces the problem of determining the rate function for large 
deviations in dense random graphs to solving a prescribed variational problem in graph limits. We 
will review the required definitions from graph limit theory and then describe the results of [5] in 
the broader context of "nice" graph parameters, generalizing subgraph counts. 

2.1. Graph limits. Let W be the space of all bounded measurable functions [0,1]^ — M that 
are symmetric (i.e., f{x,y) = f{y,x) for all x,y £ [0,1]). Further let Wo denote all symmetric 
measurable functions [0,1]^ — )■ [0,1], referred to as graphons or kernels (occasionally these are 
called labeled graphons since later we consider equivalence classes of Wo modulo measure-preserving 
bijections on [0, 1]). Lovasz and Szegedy [33] showed that the elements of Wo are limit objects for 
sequences of graphs w.r.t. all subgraph densities. Specifically, for any / G W and any simple graph 
H with V{H) = [m] ={1,2,..., m}, define 



t{H,f)= / TT f{xi,Xj)dxi---dxm. 



ii,j)&E{H) 

(We shall omit the domain of integration when there is no ambiguity.) Any simple graph G on 
vertices {1, 2, . . . , n} can be represented as a graphon by 

.Q, N J 1 if ([nx] , [ny]) is an edge of G, 
/ (X, y) = \ 

I otherwise. 

In particular, t{H, G) = t{H, f'^) for any two graphs H and G. 

A sequence of graphs {Gn}n>i ^^^^ *° converge if the sequence of subgraph densities t(^H, Gn) 
converges for every fixed finite simple graph H. It was shown in |33] that for any such convergent 
graph sequence there is a limit object / G Wo such that t{H, Gn) — )■ t{H, f) for every fixed H. 
Conversely, any / G Wq arises as a limit of a convergent graph sequence. 

We will consider several norms on W, beginning with the standard norm 

\ i/p 
\f{x,y)f dxdy] 

Each / G W can be viewed as a Hilbert-Schmidt kernel operator Tj on I/^([0, 1]) by 
{Tfu){x) = [ f{x, y)u{y) dy for any u G ^^([0, 1]), 

JO 

and the operator norm for / is then given by 

ll/ll^p := min {c > : ||r/ii||2 < c WuW^ for all u G ^^([0, 1])} . 
As Tf is self-adjoint, its operator norm is equal to its spectral radius (see, e.g., [lOl Thm. 12.25]). 
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The cut-norm on W is given by 



□ := sup 

5,TC[0,1] 



f{x,y) dxdy 
SxT 



sup 

[0,1]^[0,1] 



f{x,y)u{x)v{y) dxdy 

[0,1]2 



where the two suprema are equal since one only needs to consider {0, l}-valued u and v by the 
linearity of the integral. The second definition is useful for giving upper bounds using the cut-norm. 

For any measure-preserving map a: [0,1] — ?■ [0,1] and / G W, define /"^ G W to be given by 
f'^{x,y) = f{a{x),a{y)). We define the cut-distance on W by 

fa(/,5) = inf||/-5'^||n 

where a ranges over all measure- preserving bijections on [0, 1]. For the case of graphons this gives a 
pseudometric space (Wo, fa), which can be turned into a genuine metric space Wo, equipped with 
the same cut-metric, by taking a quotient w.r.t. the equivalence relation / ~ g iff doif, g) = 0. The 
following theorem can be viewed as a topological interpretation of Szemeredi's regularity lemma. 

Theorem 2.1 ([33j). The metric space (Wo, fa) is compact. 

It was shown in [4j that a sequence of graphs {Gn}n>i converges in the sense of subgraph 
densities if and only if the sequence of graphons f^^ S Wo converge in Wq w.r.t. the cut-distance. 
Equivalently, the topology on Wo induced by fa is the weakest topology that is continuous w.r.t. 
the subgraph densities t{H, •) for every H. This underlines one of the reasons making the cut-metric 
topology a natural choice for the space of graphons. 

2.2. Graph parameters in the cut-metric topology. We shall focus on graph parameters 
whose extensions to the space of graphons behave well under the cut-metric topology. One example 
of such a graph parameter is the subgraph density t{H, •) for an arbitrary finite simple graph 
which was defined in ^2.1 directly on the full space of graphons such that t{H,G) = t{H,f'^) for 



any graph G. A crucial feature of t{H, •) is being continuous w.r.t. the cut-metric ([!]), related to 
the existence of a "counting lemma" in the regularity lemma literature. This will be a prerequisite 
for applying the large deviations machinery of Chatterjee and Varadhan. 

Definition 2.2. A nice graph parameter is a function r: Wo — M that is continuous w.r.t. fa and 
such that every local extrema of r w.r.t. L°°(Wo) is necessarily a global extrema. We extend such 
a function r to Wo in the obvious manner and further write r(G) = t(/*^) for any graph G. 

Another way to state the local extrema condition is that if / G Wo is not a global maximum 
(resp. minimum) of r then for every e > there exists g S Wo with ||/ — g\\^ < e and T{g) > t(/) 
(resp. T(g) < t(/)). This technical condition will later imply the continuity of the rate function. 

Since the metric space (Wo, fa) is compact and path-connected, the image of r as above is a 
finite closed interval. In particular, its maximum is attained by a non-empty closed subset of Wq. 

Example 2.3 (Subgraph density). For any fixed finite simple graph H, the subgraph density 
t{H, •) is a nice graph parameter. As mentioned above, t{H, ■) is continuous w.r.t. fa and in 
fact the map / i— )• t{H,f) is Lipschitz-continuous in the metric fa ([H Theorem 3.7]). The local 
extrema condition is fulfilled since the function g~^ = min{/ + e,l} satisfies t{H,g~^) > t{H,f) 
unless t{H, f) = 1, and similarly g~ = (1 — e)f has t{H,g~) < t{H, f) unless t{H, f) = 0. 



The next two examples are of graph parameters that do not meet the criteria of Definition 



2.2 
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Example 2.4 (Probenius norm). Let r be the function that maps a weighted graph G on n vertices 
with adjacency matrix Aq to the normahzed Frobenius norm ||AG||p/n. Then r is discontinuous 
in (Wo, 5n) and therefore is not nice. Indeed, fix < p < 1 and let G„ ~ Q{n,p). The sequence Gn 
is known to converge in Wq almost surely to the constant graphon p (see Theorem 4.5]), whose 
Frobenius norm is p. In contrast to this limiting value, ||^g„ |If ~ ||/^" II 2 ~^ Vp almost surely. 

Example 2.5 (Max-cut). The function r(G) = maxcut(G)/ |y(G)|^ extends to Wo via 

t(/) = sup / fix,y) dxdy. 

UC[0,1] JUx{[0,l]\U) 

Despite being continuous w.r.t. 5a as well as monotone, the max-cut density is not nice. The 
continuity of r follows from the fact |t(/) — T{g)\ < dnif, 9), as any cut for either f or g translates 
into a cut for the other with value differing by at most 6c\{f,g). To see that r does not satisfy the 
local maxima condition, let / be the graphon defined to be 1 on [0, |] x [|, 1] U [|, 1] x [0, |] and 
elsewhere. We have r(/) = |, induced hy U = [0, This is not the global maximum for r, which 
is ^ for the constant function 1. However, we claim that / is a local maximizer of r with respect to 
the L°°-topology. By monotonicity, showing that T{g) = | for the function g = min { / + 5,1} will 
imply that r(/o) < T{g) = T{f ) for any /o with ||/o - /||^ < i. Indeed, if fi{U D [0, i]) = a and 
fi{U n [|, 1]) = b, where is Lebesgue measure, then the cut density induced by U for g is equal to 
^a(| — a) + 26(1 — b) + a(| — b) + 6(| — a), which is maximized at (a, b) = (0, |) and (a, b) = (|, 0). 



We will see in ^3.2 (see Lemma 3.6) that, in contrast to the Frobenius norm, the spectral norm 
(the focus of Theorem |1.2[ ) does behave well under the cut-metric topology, thus qualifying for an 
application of the large deviation theory of Chatterjee and Varadhan. 

2.3. Large deviations for random graphs. A random graph Gn ~ Q{n,p) corresponds to the 
random point Z*^" G Wq, thus Q{n,p) induces a probability distribution Fn,p on Wo supported on 



a finite set of points (graphs on n vertices). Recalling (1.1), we extend hp-. [0, 1] — )• M to Wo by 



hif) ■= / hp{f{x,y)) dxdy for any / € Wo. 

i[0,l]2 

An important feature of hp is that it is a convex function on [0, 1] and hence lower-semicontinuous 
on Wq with respect to the cut-metric topology ([U Lem. 2.1]). 

Using Szemeredi's regularity lemma as well as tools from graph limits, Chatterjee and Varad- 
han [8] proved the following large deviation principle for random graphs. 

Theorem 2.6 ([8]). For each fixed p G (0,1), the sequence lPn,p obeys a, large deviation principle 
in the .space (Wo, 5n) with rate function hp. Explicitly, for any closed set F C Wo, 

lim sup logE„,p(F) < - inf /ip(/) , 

n— s>oo [^2) 



and for any open U C Wo, 



liminf ^logP„,p(C/) > - inf hp{f) 



The machinery developed by Chatterjee and Varadhan reduces the problem determining the large 
deviation rate function for dense random graphs to solving a variational problem on graphons. For 
any nice graph parameter r: Wo — )• M, any p G [0, 1], and any t S r(Wo), let 

(Ar(p, t) := inf {hp{f) -.feWo, T{f) > t] . (2.1) 



Since hp is lower-semicontinuous on Wo, the infimum in (2.1) is always attained 
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The following result was stated in [HI Thm 4.1 and Prop. 4.2] for the graph parameter r = t{K: 



3, 



We state its generalization to the class of nice graph parameters as per Definition 2.2 



Theorem 2.7 (Variational problem). Let r : Wo — )• M 6e a nice graph parameter and Gn ~ G{n,p). 



Fix p G (0, 1) and t < max(T). Let 4>r{p,t) denote the solution to (2.1). Then 



lim ^logP(r(G„)>t) = -0,(p,t). (2.2) 
n^oo (2) 



Let F* be the set of minimizers for ( |2.1[ ) and let F* he its image in Wq. Then F* is a non-empty 
compact subset 0/ Wq. Moreover, for each e > there exists C = C{T,£,p,t) > so that for all n, 

^6n{Gn, F*)<e r(G„) > > 1 - e"^"' . 



In particular, if F* = {/*} for some f* E Wo then the conditional distribution of Gn given the 
event T{Gn) > t converges to the point mass at f* as n —)• 00. 

Observe that by considering — r (also a nice graph parameter) one obtains the same result for 



lower tail deviations. The intuition behind the second part of Theorem |2.7| is that the pr obab ility 

and 



that Sn{Gn, F*) > e conditioned on T{Gn) > t can be again computed using Theorem 
shown to be exponentially smaller than that of the probability of the event t(G„) > t. 



2.6 



The proof of Theorem 2.7 is a straightforward extension of the arguments of [8, Thm. 4.1] to 



nice graph parameters. A technical condition needed to complete the proof of Theorem 2.7 is given 
by the following lemma, key to which are the attributes of a nice graph parameter. 

Lemma 2.8. Let r: Wo — t- M 6e a nice graph parameter. For any p £ (0, 1), the map 1 1— (f)r{p,t) 
is continuous on r(Wo). 

Proof. Fix < p < 1. By definition, t 1— )• (f)r{p,t) is non- decreasing. To prove left-continuity, 
consider a € M. Since hp is lower-semicontinuous on Wo, the set {/ : hp{f) < a} is closed in Wo, thus 
also compact by the compactness of Wq. Since r is continuous on Wo, the set {t(/) : hp{f) < a} is 
compact and in particular closed. Note that the latter is precisely the pre-image of (—00, a] under 
the inverse of 1 1— )• <j)rip,t). As this pre-image is closed for any a S M, left-continuity follows. 

In order to prove right-continuity it suffices to show that for every S t(Wo) with to < max(T) 
and every e > there exists some / G Wq such that hp{f) < 4>r{p,tQ) + e and t(/) > to- 
Indeed, since hp-. [0, 1] — )• M is uniformly continuous (for any fixed p), there exists e' > so that 



\hp{x) — hp{y)\ < £ whenever jrr — y] < e' . Let /o be the minimizer of the variational problem (2.1) 
for t = tQ. The local extrema condition in Definition |2.2| now implies that there is some / G Wq 
with r(/) > to and ]]/ - fo\\^ < e' . Hence, hp{f) < hp{fo) + £ = (l)r{p,to) + as desired. ■ 

3. The phase diagram for subgraph densities and the spectral radius 

3.1. Subgraph density. In this section we prove Theorem |1.1[ characterizing the phase diagram 
of upper tails (replica symmetry vs. symmetry breaking) of the density of a fixed d-regular subgraph. 

Establishing the replica symmetric phase will hinge on a generalized form of Holder's inequality 
which appeared in [18]. We include its short proof for completeness. 

Theorem 3.1 (Generalized Holder's inequality). Let /ii, . . . , /i^ be measures on . • . , resp., 
and let fi = Y\i=i t^'i be the product measure on Q = YVi=i ^i- -^^t Ai, . . . , Am be nonempty subsets 
of [n] = {1, . . . ,n} and write i^A = Il^eA ^''^d ha = Yli(^A A*^- /« ^ i^Ai, fJ'Ai) with Pi > 1 
for each i G [m] and suppose in addition that X^j.£g^.(l/Pi) < 1 for each £ G [n]. Then 



/ //t lit, / /" 

Hm di^<Il[ ^/^^^ 

i=l 1=1 

In particular, when pi = d for every i G [m] we have JYYi^i \ fi\ ^ H (/ \ fi\^ dfiAi)^^'^- 
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Proof. The proof carries by induction on n with the trivial base case of n = 0. By Fubini's theorem, 

m 

"" ' '["-1] 



/ fim df^= [ u \fi\ n 1/^1 ^/^= / ( / n i/^i ^'"") n i/^i ^^[« 



where the argument of each /j is the restriction of x G 17 to the coordinates of Ai, denoted by XAi- 
Holder's inequality (along with Jensen's inequality if X]rnGAj(^/^'*) ^^^^ than 1) implies that 

/ n i/^i ^^"^ n ( / i/^i"" ^/^") ^"'^ 

•'^^ i-.n&Ai i:n<^A, ^ ''^^ ^ 

/ \ 1 /pi 

thus for each i with n G we can let /* : ^[n-i] — )■ M denote the averaging map ( l/iT' d^n ] 
and obtain that 

/ ni/*i ^/^^ / n n \f^\ ^-"["-n- 

Now, the functions /* correspond to A* = Ai \ {n} and therefore, thanks to the assumption that 
X^i-^eAi — ^ each £ G [n — 1] we can apply the induction hypothesis and infer that 



Wm d,< n ( / (f:r ^/^[n^ n 



i:n^A, ^-^^["-11 



as required. 



It is helpful to compare Theorem 3.1 with the standard Holder's inequality for the case where 
Pi = d for all i. A direct application of Holder's inequality produces the inequality ||n/«lli — 
rij ll/i|lm' whereas Theorem 3.1 exploits the extra assumption that : i £ Ai} < d for all £ G [n] 
and gives the stronger inequality \\W fi\\-^ < f^- For instance, 



Placing this in the context of subgraph densities, as an immediate corollary of Theorem 3.1 (in the 
special case that Pi = d for all i) we have the following inequality. 

Corollary 3.2. Let H be a graph whose maximum degree is at most d, and let f G W. Then 

t{H,f)<\\f\\f''K 



Recall that Theorem 2.7 reduces the problem of finding the phase boundary to determining 
whether the constant function r is a solution for the variational problem of minimizing hp{f) over 
/ G Wo subject to t{H, f) > r<^\ where H is some fixed graph. In light of the above corollary, it 
is important to estimate hp{f) for functions / G Wq with ||/||^ = r, as addressed next. 

Lemma 3.3. Let < p < 1 and let f G Wq. Suppose that d > 1 and < r < 1 are such that the 
point {r^,hp{r)) lies on the convex minorant of x i— hp{x^/^). If in addition either 

(a) p < r < 1 and ||/||^ > r, or 

(b) 0<r<p and \\f\\^ < r, 

then hp{f) > hp{r), with equality occurring if and only if f = r. 
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Figure 5. The construction in Lemma |3.4[ 

Proof. Let ■il){x) = hp{x^/'^) and let ■0 be the convex minorant of By Jensen's inequahty, 

hp{f) = I i^U{x,yf)dxdy> j ^{f{x,yf) dxdy>^{^j f{x,y)^ dxdy^ = (||/||^) = /ip 

The fact that hp{x) is decreasing along [0,p] and increasing along [p, 1] (see ^A| ) implies that under 
either of the assumptions in Part Q and Part (|b]) we have /ip(||/||rf) > hp{r), hence hp{f) > hp{r). 
Since ■0 is not linear in any neighborhood of r*^, equality can occur if and only if / = r. ■ 



The final element needed for the proof of Theorem 1.1 is a construction that outperforms the 
constant graphon in the symmetry breaking regime. This is achieved by the following lemma. 

Lemma 3.4. Let H be a d-regular graph. Fix 0<p<r<lso that (r"', hp{r)) is not on the convex 
minorant of x ^ hp{x^/'^). Then there exists f G Wo with t{H,f) > r'^W and hp{f) < hp{r). 

Proof. Since (r'^, hp{r)) does not lie on the convex minorant of x i— )• hp{x^^'^), there necessarily exist 
0<ri<r<r2<l such that the point (r*^, hp{r)) lies strictly above the line segment joining 
{rf,hp{ri)) and {r2, hp{r2)). Letting s be such that 



r'i = srf + {1 - s)ri , 



we therefore have 
Let e > and define 



shp{ri) + (1 - s)hp{r2) < hp{r) . (3.1) 



a = se"^ , b= (1- s)e^ + , 
Jo = [a, 1-6], /i = [0,a], /a = [1 - 6, 1] , 
noting that for a < 1 — b for any sufficiently small e. Define fs G Wo by 

Vi if (x,y) G (/o X /i)U (/i X /q), 
fe{x, y)= lr2 if (x, y) G (Jo x h) U {h x Jq) , 
r otherwise . 

v 

(See Fig. [5] for an illustration of this construction.) We claim that 

t{H, fe) - r<^^ = v{H) (a(rf - r'^) + 6(r^ - r"^)) r'^W"'^ + 0{e^) . (3.2) 

Indeed, the only embeddings of H that have values different from r^^^^ are those where at least one 
vertex of H is mapped to Ji U I2. Since a and b are both O(e^), in order to compute t{H, fs) — r^^^^ 
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up to an O(e^) error we need only consider embeddings of H where precisely one vertex gets mapped 
to /i U/2. Denote this vertex of H by u, and observe that if u is mapped to Ii then the contribution 



iot{HJ,) 
contribution is 



is {rf 



j,d\^e{H)—d 



since H is d-regular. Similarly, if u is mapped to I2 then the 



j,d\^e{H)—d 



Putting everything together yields (3.2). 



By definition of a, 6 and s we have 



a r 



r'^) + bi4 - r'^) = se\rf - /) + ((1 - s)e^ + e^){4 



Recalling that r2 > r and plugging the last equation in ( |3.2[ ) it now follows that t{H, fs 
for any sufficiently small e > 0. At the same time, we also have 



> r 



e(H) 



hpife) - hp{r) 



2a(l -a- 6)(/ip(ri) - hpir)) + 26(1 -a- b){hp{r2) - hp{r)) 
2(1 - a - b) {ahpin) + bhp{r2) - (a + b)hp{r)) 

2(1 - a - {shp{n) + (1 - s)hp{r2) - hpir) + (/ip(r2) - hp{r))e) . 



Revisiting (3.1) we conclude that hpife) < hp(r) for any sufficiently small e > 0. 



We now have all the ingredients needed for establishing the phase diagram of upper tail deviations 
for subgraph densities. 



Proof of Theorem II. 1[ For Part by applying Theorem 2.7 to the graph parameter t{H, •), 
it suffices to show that the constant function r is the unique element / G Wo minimizing hp{f) 
subject to t{H,F) > r'^W. Indeed, by Corollary [3^ t{H,F) > r<=W implies that ||/||^ > r, and 
by Lemma |3.3| Part ([a|), hp{f) > hp{r) with equality if and only if / is the constant function r. 

To prove Part let F* C Wq be the set of minimizers for the variational problem (|2.1l) with 
the graph parameter t{H,-). Then F* does not contain the constant function r by Lemma |3.4[ 
nor does it contain any constant function of value r' 7^ r (when r' > r one has hp[r') > hp{r), 
whereas if r' < r then t{H,f) < r^^^^). Let C C Wo be the set of constant graphons. Since F* 
and C ar e dis joint and both are compact, 5^{F* ,C) > 0. The desired result follows from applying 
"i^lwith e = 6n{F*,C)/2. 



Theorem 
When d 



with e 

2, the phase boundary is explicitly given by Lemma A. 2 thus concluding the proof. 



One can also ask what the phase diagram is for lower tail deviations of subgraph densities. We 
next show that for certain bipartite graphs there is replica symmetry everywhere for lower tails. 

A beautiful conjecture of Erdos and Simonovits |jt2j and Sidorenko (from here on referred 
to as Sidorenko's conjecture) states that every bipartite graph H satisfies t{H,G) > t{K2,G)'^^^^ 
for every graph G. The conjecture was verified for various graphs H (e.g., trees, even cycles |41j . 
hypercubes [25], bipartite graphs with one vertex complete to the other part [H]). As it turns out, 
for any such graph H the lower tail deviations are always replica symmetric (no phase transition). 

Proposition 3.5. Fix < r < p < 1, let Gn ~ G{n,p) be the Erdds-Renyi random graph and let 
H be a fixed bipartite graph for which Sidorenko 's conjecture holds. Then 

1 



lim 

n— >oo 



\ogF (tiH,Gn) <r 



e{H) 



-hp{r) 



and furthermore, for every e > there exists some constant G = C{H,£,p,r) > such that 



,r) < e 



tin, Gn) < r 



e{H) 



> 1 



Proof. Applying Theorem 2.7 with —t{H, •), it suffices to show that the constant function r is the 
unique element / G Wo minimizing hp{f) subject to t{H, f) < r<^\ Since H satisfies Sidorenko's 
conjecture, t{H,f) > \\f\\f^^ for all / G Wo. Thus, if t{H,f) < r^(^) then ||/||, < r, and so by 
Lemma 



3.3 



Part (|b]) (applied to the case d = 1, noting that then hp{x^/'^) is itself convex) we have 



hp{f) > hp{r) with equality if and only if / is the constant function r. 
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3.2. Largest eigenvalue. In this section we prove Theorem 1.2, addressing the phase boundary 
for large deviations in the spectral norm. The proof will follow a similar route as the previous 
section, yet first we must show that the spectral norm is a nice graph parameter. 

For a graph G on n vertices, let Ai(G) be the largest eigenvalue of its adjacency matrix Aq. Since 
Ag is symmetric, Ai(G) = H^cllop) ^^^d therefore over W we have H/'^H^p > Ai(G)/n. It is easy to 
verify (see Lemma 3.6 below) that in fact ||/'^|| = Xi{G)/n, thus the operator norm on W is the 
graphon extension of the (normalized) largest eigenvalue. Furthermore, as we show below, is 
(uniformly) continuous w.r.t. the cut-metric, and the local extrema condition in Definition 2.2 is 
satisfied as well, thus |Hlop is a nice graph parameter. 

Lemma 3.6. The function ||-||op is a continuous extension of the normalized graph spectral norm, 
i.e., Xi{G)/n for a graph G on n vertices, to (Wo, fa). Moreover, \\-\\^p is a nice graph parameter. 

Proof. We first show that ||/||op = \i{G)/n for any graph G on n vertices. Clearly, the largest 
eigenvector of Aq, the adjacency matrix of G, can be turned into a step function u: [0, 1] — t- M such 
that Tjgu = (Ai(G)/n)u, and so ||/'^||qp > Ai(G)/n. Conversely, for any u: [0, 1] — )• M we consider 

the step function Un : [0, 1] — )• M such that for any 1 < i < n, on the interval (^, ^] it is equal to 
the average of u over that interval. Let v E M" be the vector of values of n„. Since f^ is constant 
in every box (^, ^] x (i^, i], we have 

llT/nllg = ||r/u„||2 = 11^0^^112/"'^ < ^i{G) \\v\\2/'n'^ = Ai(G) ||u„||2/n < Ai(G) 1114112 /n, 

where the last inequality is due to convexity. It follows that ||/||op = Ai (G)/n. 
Next, we will argue that 

ll/llop ^ 4 ll/llg for any symmetric measurable /: [0, 1]^ — )• [—1, 1]. (3-3) 
Let u G -i^^([0, 1]) with ||n||2 = 1. By Cauchy-Schwarz we can infer that 

f{x,y)u{y) dy] dx] = ( [ f{x,y)f{x,y')u{y)u{y') dxdydy'^ 




f{x,y)f{x,y') dx] dydy I u{y) u{y') dydy 




2 

/^2 



f{x, y)f{x, y')f(x', y)f{x', y') dxdx dydy' . 
For any fixed x' ,y' we can let Vy'{x) = f{x,y') and w^'iy) = f{x',y), thus rewriting the above as 

f{x,y)vyi{x)wxi{y) dxdy^ f{x',y') dx'dy' < 4||/||n , 
with the last inequality justified by the fact that for any g and v,w: [0, 1] — ?■ [—1, 1] we have 



I J g{x,y)v{x)w{y) dxdy\ < 4 by the definition of the cut-norm, thereby establishing Eq. (3.3). 
(The factor of 4 above was due to splitting v, w into positive and negative parts. Indeed, for 
/: [0, 1]^ — )■ [0, 1] the bound in (3.3) remains valid without the factor of 4 in the right-hand side.) 



Consider /, 5 € Wo and let a vary over all measure-preserving bijections on [0, 1]. We then have 



op 1 1 5 1 1 op 



< inf 11/ - ^niop < ^/2inf 11/ - ffHIn^' = 9?'' 



thus implying that ||-||op is uniformly continuous in (Wo,fa) 



Finally, we need to verify the local extrema condition in Definition 2.2 Let / G Wo- There are 
no local non-global minima since ||(1 - e)/||op = {l-e) WfW^^ < ||/||op unless ||/||^p = already. In 
addition, we claim that g = min{/ + e, 1} satisfies H^Hop > ||/||op unless ||/||op = 1 already. Indeed, 
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take some u S -^^^([0, 1]) which is nonzero (a.e.) and satisfies u > and TfU = \\f\\^p u. It suffices to 
show that TgU > TfU on some subset of [0, 1] with positive measure. Let A = {x £ [0, 1] : u{x) > 0} 
be the support of u, which by our choice has positive Lebesgue measure n{A) > 0. If n{A) < 1 
then TfU = u = (a.e.) on A" := [0, 1] \ A, so that f = on A"" x A. Hence, g = e on A"" x A 
and TgU = e \\u\\i > = Tju on A'^, as desired. Suppose therefore that IJ,{A) = 1. If ||/||op < 1 we 
must have g > f on a subset of positive measure, and hence also TgU > TfU on a subset of positive 
measure, as u has full support. This shows that / cannot be a local maximum unless H/H^p = 1. ■ 



In light of the above lemma, the variational problem under consideration in Theorem 1.2 becomes 

inf{/ip(/):/GWo,||/Lp>r}. (3.4) 
We will need the following straightforward inequality relating the operator norm, ||-||-^ and ||-||2- 
Lemma 3.7. For every f eWo we have \\f\\^ < ||/||„p < ||/||2. 
Proof. For the left inequality, observe that since / > we have that 

ii/iii = r/iiii<r/iii2<ii/iiop- 

For the right inequality in the statement of the lemma, let u: [0, 1] — t- M. By Cauchy-Schwarz, 

11^/^112 ^ / (/ f^^^y)^^y) '^^ - {^j fi^^yf dydx^ (^j u{yf dy^ = \\f\\l \\u\\l , 

and therefore ||/||op < II/II2) claimed. ■ 

The following lemma is the operator norm analogue of Lemma 3.4 , providing a construction that 
beats the constant graphon in the symmetry breaking regime. 

Lemma 3.8. Let < p < r < 1 be such that {r'^,hp{r)) does not lie on the convex minorant of 
X I— )• hp{y/x). Then there exists some f £ Wo with ||/||op > r and hp{f) < hp{r). 

Proof. Let e > 0. Let fe be the construction from the proof of Lemma [3.4| with d = 2, and define the 
parameters of that construction ri, r2, s, a, b, Lq, Ii, I2 as given there. Having already demonstrated 
that hp{fe) < hp{r) for any small enough e > 0, it remains to show that ||/e||op > r. To this end, 
it suffices to exhibit a function u G -Z>^([0, 1]) such that {Tf^u){x) > ru{x) for all x G [0, 1]. Let 

(1 — a — b)ri \i X £ Ii , 

^(^) = {{^ — — b)r2 if X G I2 , 
r if X £ Iq . 

Recall that fs{x,y) is r except when {x,y) G (/q x /j) U (Ii x Iq) where it is rj for i = 1,2. It now 
follows that for every x £ Ii = [0, a] 

{Tf^u){x) = a(l — a — b)rir + 6(1 — a — b)r2r + (1 — a — b)rir > (1 — a — b)rir = ru{x) . 
Similarly, for any xG/2 = [l — 6,l]we have 

(Tf^u){x) > (1 — a — 6)r2r = ru{x) . 
Finally, if x G lo = [a, 1 — b] then 

{Tf^u){x) = a{l-a- b)rj + b{l - a - b)rl + (1 - a - b)r'^ = {I - a - b){r'^ + arf + 6r|) . 
Plugging in the facts that = srf + (1 — s)r2 while a = se^ and 6 = (1 — s)e^ + e^, we get that 



{Tf^u){x) = (1 - _ £3)(^2 ^ ^2^2 ^ ^2^3) ^ ^2 ^ (^2 _ ^2^^3 ^ ^(^4^ ^ ^2 

where the strict inequality is valid for any sufficiently small e > since r2 > r. 
Altogether, {Tf^u){x) > ru{x) for all x G [0, 1] and so ||/||op > r, as required. 



ru[x) 
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2.7 



applied to the graph parameter 



op 



Proof of Theorem II. 2[ To prove Part ^ , by Theorem 
it suffices to show that the constant function r is the unique element / S Wo minimizing hp(f) 

Part Q and 



subject to 



op 



> r. Indeed, by Lemma 



3.7 



> 



op 



> r. By Lemma 



3.3 



Lemma A.2| we know that hp{f) > hp{r), with equality if and only if / is the constant function r. 

For Part ([n]), similar to the proof of Part (|n|of Theorem [lT] in ^3.1 Lemma 3.8 implies that the 
set of minimizers of the variational problem (2.1 ) is disjoint from the set of constant graphons. We 



then apply Theorem 2.7 to conclude the proof, with the phase boundary given by Lemma A. 2 



The behavior of the lower tails deviations in the spectral norm is similar to that of the subgraph 



densities in Proposition 3.5, where replica symmetry is exhibited everywhere (no phase transition). 



Proposition 3.9. Let < r < p < 1. Let Gn ~ G{n,p) be the Erdos-Renyi random graph and let 
Ai(G„) denote the largest eigenvalue of its adjacency matrix. Then 



lim ^logP(Ai(G„) < r) = -hp{r) 

and furthermore, for every e > there is some C = C{e,p,r) > such that for all n, 

W{5u{Gn,r) < e \ Ai(G„) < nr) > 1 - e"^"' . 

it suffices to show that the constant function r is 
< r. By Lemma 



Proof. Applying Theorem 



2.7 



with r 



lop' 



the unique element / S VVq minimizing hp{f) subject to 



with 



op 



< r then 



op 



< 



op 



< r. It now follows from Lemma 



3.3 



|3.7| if / £ Wo 
Part ([b]) (used with d = 1 



bearing in mind that hp{x) is convex) that hp{f) > hp{r) with equality if and only if / 



r. 



4. Exponential random graph models 

Let us review the tools developed by Chatterjee and Diaconis to analyze exponential random 
graphs. Define 

h{x) := X log X + (1 — x) log(l — x) for x G [0, 1] , 
and for any graphon / G Wq let 



h{f):= [ h{f{x,y))dxdy. 
i[0,l]2 



The following result from [7, Thm. 3.1 and Thm. 3.2] reduces the analysis of the exponential random 
graph model in the large n limit to a variational problem. It was proven with the help of the theory 
developed by Chatterjee and Varadhan [8] for large deviations in random graphs. 

Theorem 4.1 (Chatterjee and Diaconis [7J). Let r: Wo — )• M 6e a bounded continuous function. 
Let Zn = exp ((2)t(G)) where the sum is taken over all 2(2) simple graphs G on n labeled 
vertices. Let ipn = {2) ^log-^n- Then 



ijj := lim ipn 



sup (t(/) - h{f )) 

fern 



(4.1) 



and the set F* C Wo of maximizers of this variational problem is nonempty and compact. 

Let Gn be a random graph on n vertices drawn from the exponential random graph model defined 
by T, i.e., with distribution eiqp ((2)''"("))- Then for every 77 > there exists C = G{T,r]) > 
such that for all n, 

FiSoiGn,F*)>rj))<e-'^^\ 
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We say that the exponential random graph model has replica symmetry if the set of maximizers 
F* for the variational problem t(/) — h{f) contains only constant functions, and we say that it has 



replica symmetry breaking if no constant function is a maximizer. Intuitively, Theorem 4.1 implies 
that when there is replica symmetry, for large n, the random graph behaves like an Erdos-Renyi 
random graph (or a mixture of Erdos-Renyi random graphs) , while this is not the case when there 
is broken symmetry. More precisely we have the following result (see [Ti Thm. 6.2]). 



Corollary 4.2. Continuing with Theorem 4-1 Let C C Wo be the set of constant functions. If 

F*nc 



then there exist C, e > such that for all n, 

5n{Gn,C)>e) > 1- 



To prove Theorem 1.3 using the above tools, we need to analyze the following variational problem: 

sup {mK2, f) + fT - Kf)) . (4.2) 

/eWo 



Here is the main result of this section, from which Theorem 1.3 follows by the results above. 
Theorem 4.3. Let H be a d-regular graph (d>2) and fix /3i G R and a, > 0. Let £ denote the 



corresponding exponential random graph model on n labeled vertices as specified in (1.4). 

(a) If a > d/e{H), then £ has replica symmetry. Moreover, there exists a set T cM? of the form 



{(/3i,(^(/3i)) <log(e(i?)a-l) 



e{H)a 
e{H)a- 



lie 



for some function (p : 



such that when (/3i, (32) E M x (0, cxd) \ F the set of maximizers of the variational problem (4.2) 
is a single constant function, and when (/3i,/32) S T the set of maximizers consists of exactly 
two distinct constant functions. 

(b) IfO<a< d/e{H) and /3i > \og{d— 1) — d/{d— 1) then £ has replica symmetry. Moreover, the 
variational problem ( |4.2[ ) is maximized by a unique constant function. 

(c) If < a < d/e{H) and (3i < log{d — 1) — d/{d — 1) then there exists an open interval of 
values /32 > for which £ has broken symmetry, i.e., the set of maximizers of the variational 



problem (4.2) does not contain any constant function. Furthermore, this open interval can be 
taken to be with ^2 = u^-^^^'^h'^iu) /{e{H)a) and^^ = u^'^^^^'^h'^iu) / {e{H)a) , where 

{yf^, hp{u)) and {u^, hp{u)) are the two points where the lower common tangent of x ^ hp{x^/'^) 
touches the curve forp = l/(l + e-^i). 



When restricted only to constant functions in Wo, the variational problem (4.2) becomes the 
one-dimensional optimization problem 



sup {PlU + P2U 
0<u<l 



e(H)a 



h{u)). 



(4.3) 



Note that for both (4.2) and (4.3) the supremum is in fact a maximum due to compactness. Let u* 



be the maximizer for (4.3). When there is replica symmetry, the maximum values attained in (4.2) 



and (4.3) are equal, and the exponential random graph behaves like an Erdos-Renyi random graph 
with edge density u* . It is possible that there are two distinct maximizers u* , in which case the 
model behaves like a (possibly trivial) distribution over two separate Erdos-Renyi models. In the 
work of Chatterjee and Diaconis [7J, where the a = 1 case was considered, it was shown that u* as 
a function of (/3i, /32) experiences a discontinuity across a curve in the parameter space. Radin and 
Yin [38] later showed that (when a = 1) the limiting free energy density ip from (4.1 ), as a function 
in the parameter space (/3i,/32), is analytic except on a first order phase transition curve ending in 
a critical point with second order phase transition. (See Fig. [s] in ^ for a plot of the location of 
the discontinuity in the (/3i, /32)-phase diagram.) 

Here we focus less on the discontinuity of u* and more on symmetry breaking. Nevertheless we 
shall start our analysis by giving a simple geometric interpretation of the discontinuity of u* . 
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Figure 6. Discontinuity in the symmetric solution u* due to the geometry of x i— >■ hp{x^/^^^^^"^). 



By definition, 



Setting 



hp{x) 



h{x) — X log 



P 



1-p 



log(l - p) . 



1 



1 + e- 



log we absorb the linear term in (4.2) and (4.3) into the entropy term, at which 



so that Pi 

point these two optimization problems respectively become 

hpif) - log(l 



P)) 



and 



By a change of variables u 



sup {(32U^^"^' 
0<u<l 



hp{u) 



log(l -p)) . 

in (4.5) we get the equivalent optimization problem 



sup [I32X 
0<x<l 



/ip(xV(^(^)"))-log(l-p)) 



(4.4) 



(4.5) 



(4.6) 



Observe that x = x* maximizes (4.6) iff the tangent to the curve defined by x i— )• hp{x^ / ^^^^'^°'^) at 
X = X* has slope (32 and lies below the curve. Thanks to Lemma A.l from the appendix, we know 
that X I— >• hp{x^/^) is convex if < 7 < 1 or if 



7 > 1 and p > ^0(7) : = 



7-1 



(4.7) 



7 - 1 + eT/(')'-i) ' 

Otherwise, hp{x^^"') has exactly two inflection points to the right of x = p^, so that the curve starts 
convex, becomes concave, and finally turns convex again. In addition, hp{x^/^) has an infinite 
slope at both endpoints. For any /32 G M, there is a unique lower tangent of slope /?2 to the curve 
hp{x^/^'^^^'>^^), touching the curve at x = x* = {u*Y'^^^°'. As (32 varies, u* increases continuously 
with /32, except in the situation where the curve of hp{x^^^'^^^^"^) is not convex and (32 is the slope 
of the unique lower tangent that touches the curve at two points. In that case, (4.5) is optimized 



at two distinct values of u, denoted byO<M<ti<l, and as (32 increases through this critical 
point, u* jumps over the interval {u,u) corresponding to the part of the curve lying above the 
convex minorant, then increases continuously afterwards. When hp{x^^^^^^')"^) is convex, this jump 
does not occur. See Fig. [5] for an illustration of this process (the function /ip(x^/^'^^^^'^^) is plotted 
not-to-scale in order to highlight its features) . The uniqueness of u* is stated below as a lemma. 
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H = Ks, a ^ 1 



H = K3, a = 0.6 




trajectory of {p, u*) 
as Pi fixed 



symmetry 
broken in 
the shaded 
region 



Figure 7. Discontinuity in the symmetric solution u* as reflected in the (p, n)-phase diagram. 



Lemma 4.4. If < e{H)a <1 or if e{H)a > 1 and p > pQ{e{H)a) as defined in (4.7), then the 
optimization problem (4.3) is a maximized at a unique value of u. Otherwise, (4.3) is maximized 
at a unique u except for a single value of 132, where the maximum is attained at two distinct u's. 

We can also represent this jump of u* in the {p, ii)-phase diagram as follows. For each 7 > 0, 
consider the region C [0, 1]^ containing all points (p, u) such that {u"^, hp{u)) does not lie on the 
convex minorant of x 1— hp{x^/"'). When 7 < 1 the region is empty, but otherwise it is nonempty. 
The geometric argument in the previous paragraph shows that u can never appear as a maximizer 
to (4.5) if {p,u) G 'Be(_H')a! ^ut all other values of u can. Thus, for a fixed p = 1/(1 + e~^^), as 
P2 increases from —00 to 00, the point {p,u*) moves up continuously from in the (p, ti)-phase 
diagram, and jumps over i3e(//)Q as it reaches it. Thereafter it resumes moving up until hitting 1. 
This process is illustrated on the left of Fig. [7] when e{H)a = 3, e.g., when H = and a = 1. 



Turning to large deviations, we know from Lemma |3.4| that there is broken symmetry for the 
density of copies of a d-regular graph H whenever we are in Bd- It turns out that the same is 



true for the corresponding exponential random graph model given in (1.4). As we just saw though, 
one must remove Bf,{ H)a f rom the possible solution space for {p,u*). Whenever 7 < 7' we have 
B-y C By (see Lemma A.5), and consequently, if e{H)a > d then Be(H)a covers Bd and so the entire 
symmetry breaking phase is removed, leaving replica symmetry everywhere. This agrees with the 
results of Chatterjee and Diaconis [7| for the case a = 1. However, when e{H)a < d it is possible 
to have {p,u*) E Bd, in which case the construction from Lemma 3.4 breaks the symmetry. This is 
shown on the right of Fig. [Tjfor the case d = 2 and e{H)a = 1.8, e.g., for H = and a = 0.6. 

Proof of Theorem 14. 3[ The proof of Part ([a]) is essentially the same as the proof of Theorem 4.1 
in the work of Chatterjee and Diaconis [7j, except now we use the generalized Holder's inequality 



(Theorem 3.1) instead of the usual Holder's inequality. 

Suppose that a > d/e{H). We first need to show that in this case the only maximizers for the 
variational problem (|4.4|) are constant functions. Applying Corollary 3.2 for any / E Wo we have 

hpif) , 



/32tiH,fr - hp{f) < 132 WfWT' - hpif) < h 
where the last inequality used the assumption on a. This in turn is equal to 



e{H)a 
e{H)a 



{f32f{x,y)<''^"-hp{f{x,y)))dxdy< sup /Jsn^^^)" 

0<u<l 



hp{u) 
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showing that the j32t{H, f) — hp{f) is indeed maximized at constant functions. Furthermore, when 
f32U^^^^'^ — hp{u) is maximized at a unique u* , then equality holds in place of the above inequalities 
only for the constant function f = u* . An additional argument is needed to treat the case when 
j32U^^^'>°' — hp{u) is maximized at two distinct values. Either by checking the equality conditions 
in the proof of Theorem 3.1, or by referring to [18], we know that equality in Corollary 3.2 occurs 
if and only if /(x, y) = g[x)g{y) for some function g: [0, 1] — )■ [0, oo). It can then be easily checked 
that equality in the above sequence of inequalities can only occur when / is a constant function. 



The value of this constant u* is given by the optimization problem (4.5) and its the uniqueness is 
addressed in Lemma |4.4[ 

We now turn to prove Part Q. Since /3i > log(d — 1) — d/{d — 1), 

^ 1 . 1 ^ d-l ^ 

P 1 + e-/3i - 1 + e- iog{d-i)+d/id~i) d-l + e'^/('^-i) ' ' 



By Lemma A.l, x i— )• hp{x^/^) is convex for this value of p. Hence, hp{f) > hp{\\f\\^) by Jensen's 



3.2 



inequality with equality if and only if / is a constant function. Since t{H, f) < ||/||^ by Corollary 

MHjr - hpif) < P2 11/11?''^" - hpiyWd) < sup (/32«^(^)" - hpiu)) , 

0<ji<l 

with equality iff / is the constant function equal to u* , the unique maximizer of f32u'^^^^" — hp{u). 
The uniqueness of u* follows from Lemma [4.4| together with noting that when e{H)a > 1 we have 
P > Po{d) > po{e{H)a) as d > e{H)a and po{-) is increasing in [1, oo). 

It remains to prove Part (|c]). We have Q < p < pQ{d). Let 0<u<ti<lbe such that the 
lower common tangent to x i— >• hp{x^^^) touches the curve at x = u'^ and u'^. Since e{H)a < d, 
implies that the points {yf^^^°',hp{u)) and {u^^^^" , hp{u)) both lie on the convex 



A.5 



Lemma 

minorant of x i— )• hp{x^^^^^^^°'^)) and do not lie on the common lower tangent (if there is one). Let 
/32 and be as in the theorem statement, observing that these are the values of the derivative of 
hp{x^^^^^^^°'^) at X = n^(^)" and u^^^^", respectively. Then for any /3 G {H2il^2)i using the slope 
interpretation of /32 given in the discussion proceeding this proof, we see that the optimization 



problem (4.5) is maximized for some u* G {uiu). At the same time, by Lemma |3.4| there exists 
some / G Wo such that t{H, f) > {u*Y^^'i and hp{f) < hp{u*). It follows that 

sup {l32t{H, rr - hpif)) > /32(n*)^(^)- - hpiu*) = sup (/32n^(^)" - hpiu)) , 
/eWo o<u<i 

and hence f32tiH, /)" — hpif) is not maximized at any constant function. ■ 

5. Densities of linear hypergraphs in random hypergraphs 

In this section, we extend our results to densities of linear hypergraphs in random hypergraphs. 
Homomorphisms and densities are defined as in graphs. Similarly, for any r G [0, 1] let 

6niG,r) := sup J eaiAi, . . . , At) -r\Ai\ ■■■\Ak\ \ 

A^,...,A,CV{G)\ViG)f' 

where eciAi, . . . ,Ak) is the number of (ordered) hyperedges of the form (oi, . . . , a^) G x • • • x ^4^. 
The main result in this section is the following: 

Theorem 5.1. Fix d,k >2 and < p < r < 1. Let H be a d-regular k-uniform linear hypergraph. 

Let Gn ~ Q^^\n,p) be the random k-uniform hypergraph on n vertices with hyperedge probability p. 

(i) If the point (r*^, /ip(r)) lies on the convex minorant of the function x i— )■ hpix^^'^) then 



lim logP (tiH, Gn) > r<''A = -hpir) 
,^oo (J V y 
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and furthermore, for every e > there exists some constant C = C{H,e,p,r) > such that 
F r)<e\ t{H, G„) > r^(^)) > 1 - e"^"' . 

(ii) If the point {r'^, hp{r)) does not lie on the convex minorant of the function x i— )• hp{x^^'^) then 

lim T^logP (t{H,Gn) > r<''A > -hp{r) 

and furthermore, there exist e,C > such that 

P {inf {6n{Gn, s) : < s < 1} > e t{H, G„) > r^^^'^^ > 1 - e"^"* . 

In particular, when d = 2, case ^ occurs if and only if p < [l + {r^^ — l)i/(i~2r)j ^ 

Let A; > 2 be an integer and let W'-^^ be the space of all bounded measurable functions [0, l]'^ — t- M 
that are symmetric (i.e., f{xi,X2,---,Xk) = /(^^^^(i), x^(2)) • • • > any permutation vr of 
[k] = {1,2, ... ,k}). Let Wg'^^ denote all symmetric measurable functions [0,1]'^ — )• [0,1]. Every 

fc-uniform hypergraph G corresponds to a point / G Wq similar to the case for graphs. As 
before, we can endow W with usual L^-norm and in addition have the following cut norm: 



/ f{xi, ...,Xk) dxi--- dxk . 

Js^ x-xSt 



□ := sup 

Si,...,5fcC[0,l] JSix-xSk 

(k) 

This gives rise to the cut distance: for any f,g£ Wq , 

6n{f,g) ■.= mi\\f-g^\\^ 

a 

where a ranges over all measure-preserving bijections on [0,1], and G w'^^ is defined by 
g^{xi, . . . , Xk) = g{(j{xi), . . . , a{xk))- Let be the metric space formed by taking equivalences 
of points in Wq with zero cut-distance. 

(k) 

The space Wq is a straightforward generalization of the space Wo of graphons. Unfortunately, 
it does not fully capture the richness of the structure of hypergraphs. This notion is closely related 
to some initial attempts at generalizing Szemeredi's regularity lemma to hypergraphs (e.g., |10j). 
The main issue is that while the regularity lemma generalizes easily to this setting, there is no 
corresponding counting lemma for embedding a fixed hypergraph H unless H is linear (recall that 
a hypergraph is linear if every pair of vertices is contained in at most one hyperedge). The difficulty 
in extending the results to general H is related to the intricacies of hypergraph regularity (see, e.g., 
Gowers [2H] and Nagle, Rodl, Schacht, and Skokan jS^l EH]) as well as the recent progress in this 
direction by Elek and Szegedy |16t IT7j ) . Here we restrict ourselves to the basic setting above which 
suffices for controlling densities of linear hypergraphs. 

For any / E W^'^^ and any fc-uniform hypergraph H, write V{H) = [m] and define 

t{H,f)= TT f{xi^,...,Xi^^)dxi---dxm- 

The Chatterjee-Varadhan theory can be generalized to derive rate functions for large deviations 
of iJ-counts, where if is a fixed linear hypergraph. We outline the modifications and omit the 
complete details, as the changes required in the original proofs are mostly straightforward. 

We start with a statement generalizing the weak regularity lemma of Frieze and Kannan [19]. 
The analytic form of this statement for graphs can be found in Lovasz and Szegedy [34^ Lem. 3.1]. 
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Theorem 5.2. For every e > there exists some M{e) > such that for every f G Wq^^ there 
exist some m < M{e) and some g G Wg^^ with So{f,g) < e, and such that g is constant in each 
box (^,^1 X ••• X 



Using Theorem 5.2, the proof of Lovasz and Szegedy [34^ Thm. 5.1] can be modified to give the 
following topological interpretation of this result. 

Theorem 5.3. For any integer k >2, the metric space (Wg'^^fa) is compact. 



Theorems 5.2 and 5.3 allow us to generalize the framework of Chatterjee and Varadhan to 
(Wq^^^^d). The random hypergraph graph Q^^\n,p) corresponds to a random point J^'^H^.p) g 
yv'^'^\ and therefore it induces a probability distribution P^.p on W^^"^ supported on a finite set of 
points corresponding to hypergraphs on n vertices. 

Theorem 5.4. For each fixed p € (0, 1), the sequence P„.^p obeys a large deviation principle in the 

/ (k) \ (k) 

space (Wg ,fa) with rate function hp. Explicitly, for any closed set F C Wg , 

limsup logP„,p(F) < - inf /ip(/) , 

(k) 

and for any open set U C Wg , 

hminf ^ logP„,p([/) > - inf hp{f) . 

To derive large deviation results for subgraph densities in random graphs, it was crucial that 
the subgraph densities •) behaved continuously with respect to the cut topology. The next 
result implies that the same is true when H \s a linear hypergraph. The proof is a straightforward 
generalization of the proof for graphs (see [4, Thm. 3.7]). 

Theorem 5.5. Let H be a k-uniform linear hypergraph. Then for any f,gG Wg , 

\t{H,f)-t{H,g)\<e{H)6n{f,g). 

The rate function for large deviations in //-counts is then determined by the following variational 
problem. 

Theorem 5.6. Let H be a k-uniform linear hypergraph and let Gn ~ ^^''^(n,p) be the random 
k-uniform hypergraph on n vertices with hyperedge probability p. For any fixed p,r £ (0, 1), 

lim^-^logF (^tiH,Gn)>r<''^)=- mi [hpif): fG W^,''\tiH J) >r . (5.1) 

, . ~ — if.\ ~ 

Let F* be the set of minimizers for (5.1) and let F* be its image in Wg . Then F* is a non-empty 

compact set. Moreover, for each e > there exists some C{H,e,p,r) > so that for any n 

'fa(G„, F*) > e t{H, Gn) > r^^'A < e"^'^' . 



In particular, if F* = {/*} for some f* G Wg'^^ then the conditional distribution of Gn given the 
event t{H, Gn) > r^^^^ converges to the point mass at f* as n —)• oo. 



We now turn to study the variational problem (5.1) towards the proof of Theorem 5.1 The 
following inequality is an immediate consequence of Theorem |3.1[ 



(k) 

Lemma 5.7. Let H be a k-uniform hypergraph with maximum degree at most d, and let f £ Wg . 



ThentiH,f)<\\ffJ 



e(H) 



The following lemma mirrors Lemma 3.4 for proving the symmetry breaking phase. 



ON REPLICA SYMMETRY OF LARGE DEVIATIONS IN RANDOM GRAPHS 



23 



Lemma 5.8. Let H be linear d-regular k-uniform hypergraph. Let 0<p<r<lbe such that 
{r'^,hp{r)) does not lie on the convex minorant of x ^ hp{x^/'^). Then there exists f G Wq'^^ with 
tiHJ)>r<^^ andhp{f)<hp{r). 

The proof of Lemma |5.8| is essentially the same as the proof of Lemma |3.4[ with the following 
modification. One needs to adjust fe into /e = r + (ri — r)l^ + (r2 — where A C [0, l]'^ is 

the union of the box [0, a] x [a, 1 — 6]'^^^ along with the k — 1 other boxes formed by permuting the 
coordinates. Similarly B is the union of [1 — 6, 1] x [a, 1 — h]^~^ and its coordinate permutations. 



Proof of Theorem 15. 1[ Our starting point is an application of Theorem 



satisfies t{H^ f) > 



By Lemma 

,(fc) 



5.7 



5.6 



Suppose / G VU 



(fc) 



> r. For Part m\i of the theorem, Lemma 



3.3 



Part ([a]) (which is also valid for Wq ) implies that hp{f) > hp{r) with equality if and only if 



/ is the constant function r, so the variational problem on the right-hand side of (5.1) has the 
constant function r as the unique minimizer. For Part ^ of the theorem, Lemma 5.8 implies that 



the constant function r is not in the set of minimizers of the variational problem (5.1), and the 
conclusion follows analogously to the proof of Part of Theorem |1.1[ ■ 

6. Graph homomorphism inequalities 



Our goal in this section is to present a short new proof of Theorem 1.4, stating that for any 
d-regular bipartite graph G and any graph H allowing loops one has 

hom(G, H) < hom{Kd,d, H)^V{G)\/{2d) _ 

(This inequality is tight when G is a disjoint union of copies of the complete bipartite graph Kd^d ) 
Generalized to graphons, the inequality states that for any d-regular bipartite graph G and / G Wq 

i(G',/)<t(K,,.,/)l^(^)l/(2'^) (6.2) 

(the more general formulation follows from ( |6.1| ) via a standard limiting argument, e.g., see |21j). 
As mentioned in the introduction, all previously known proofs of this inequality involved entropy. 
In contrast, the following proof does not rely on entropy or limiting arguments and instead is an 



immediate consequence of the generalized Holder's inequality (Theorem 3.1). 



Proof of Theorem II. 4[ Label the vertices on the left-bipartition of G by [n] = {1, . . . , n}, and 

let Ai C [n] be the neighborhood of the i-th. vertex on the right-bipartition of G. Define 



g{xi, ...,Xd):=J fixi,y)f{x2,y) ■ ■■ fixd,y) dy for any xi,...,Xd£ [0,1] , 
and write g{xA) for x G [0, 1]" and A = {ii, . . . ,id} to denote g{xi-^, . . . ,Xi^). With this notation. 



t{G, f) = I WW fixi, Vj) dxi--- dxndyi ■ ■ ■ dyn = / 

1=1 iti •^[0.1]'^ 



9{xA^)---g{xAn) dx < 115112 



by the generalized Holder's inequality (Theorem 3.1). The result follows from noting that 

d d 

W W yj") ■ ■ ■ dxddyi ■■■dyd = t{Kd,d, f) ■ 
i=i j=i 



\9\\d 



Remark. A natural question to ask is whether Theorem |1.4| can be extended to all d-regular graphs 
G, as in the case for independent sets Unfortunately, the answer to this question is negative. 
A simple counter-example is G = and / being the graphon corresponding to the 2x2 identity 



matrix. The second author |47j extended Theorem 1.4 to non-bipartite G for certain families of 
/ G Wo, e.g., {0, 1}- valued graphons that are non-decreasing in both coordinates. Galvin [20] 
conjectured that if G is a simple d-regular graph and H is & graph allowing loops then hom(G, H) 
is at most the maximum of hom(i^rf,rf,i7)l^(^)l/(2'i) and hom(ifd+i, 
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7. Open problems 

It is natural to ask for extensions of the Chatterjee-Varadhan [8] large deviations theory to 
sparse Erdos-Renyi random graphs (i.e., 0{n,p) where p{n) — t- as n — >• oo) or to densities of 
general (not necessarily linear) hypergraphs in random hypergraphs. These may require extensions 
of Szemeredi's regularity lemma to sparse graphs (see [32l [22l [12]) and hypergraphs [231 ESI 139] . 
Even for Q (n, p) with fixed p, various problems remain open, several of which we highlight below. 

Minimizers of the variational problem. It was pointed out by Chatterjee and Varadhan [8] that 



no solutions of the variational problem in Theorem 2.7 are known anywhere in the symmetry 
breaking phase. This remains the case. In fact, there is not a single point {p, r) in the symmetry 
breaking phase where we can even compute the large deviation rate. It would be interesting to see 
whether the minimizers are always 2-step graphons, i.e., graphons which are constant on each of 
[0, wf, ([0, w] X {w, 1]) U {{w, 1] X [0, w]) and [w, 1]^ for some w £ (0, 1). 

Phase boundary for non-regular graphs. In this paper, we identified the replica symmetric phase 
for upper tail deviations in the densities of d-regular graphs. The phase boundary for non-regular 



graphs remains unknown. We suspect that a modification of the construction in Lemma 3.4 can be 
used to establish the symmetry breaking phase for some (perhaps all) subgraph density deviations. 
However, at present we do not have matching boundaries for the replica symmetric phase. 



Lower-tail phase transition. Proposition 3.5 shows that if a bipartite graph H satisfies Sidorenko's 
conjecture then there is replica symmetry everywhere in the lower tail deviation of i^-densities. 
It is an open question whether all bipartite graphs satisfy Sidorenko's conjecture, although it is 



possible that Proposition 3.5 can be proved without the full resolution of Sidorenko's conjecture. 

When H is not bipartite, the following argument shows that there exists symmetry breaking 
in the lower tail, at least for certain values of {p,r). Let / be the graphon taking the value on 
[0, U [|, 1]^ and the value p elsewhere, so that t{H, /) = and hp{f) = /ip(0)/2. Let tq G (0,j?) 
be such that hp{0) = 2/ip(ro). Then for any r G (0, rg) we have hp{f) < hp{r), and so (p, r) is in 
the symmetry breaking phase, resulting in a nontrivial phase diagram. We currently do not know 
the complete lower tail phase diagram for any non-bipartite H. 

Symmetry breaking in exponential random graphs. Fig. [3| sh owed several (/3i, /32)-phase plots for the 



symmetry breaking region given in Part Acv of Theorem 4.3 However, unlike the situation for large 



deviations, we do not know if that is the full region of symmetry breaking. It would be interesting 



to characterize the full set of triples {a, /32) for which there is replica symmetry in Theorem 1.3 



Appendix A. The convex minorant of hp{x^/'^) 

This appendix contains some technical lemmas about the convex minorant of hp{x^^"'), which 
appears throughout the paper. Let us first informally summarize the claims. It is well known that 

hp{x) = X log - + (1 - x) log ^ — - 
p 1 — p 

is a convex function of x. When 7 > 1 (here we allow any real 7, not just integers), it turns out 
that there is some Poij) for which x 1— t- /ip(xi/'>) is still a convex function when p > po- However, 
when p < pq, the function x 1— )• hp(x^^"') is no longer convex: it has exactly two infiection points, 
both to the right of its minimum at x = p^ . The function is concave in the corresponding middle 
region, whereas it is convex in the two outer regions. 

In the case when x 1— )■ hp{x^^^) is not convex, it has a unique lower tangent, touching the plot 
of the function at the points {q^,hp{q)) and {(p,hp{q)). The convex minorant of x 1— )• hp{x^/'^) is 
formed by replacing the middle segment x G {(p ,Q^) by the lower common tangent, as shown in 
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Figure 8. Illustration of the convex minorant oi x ^ hp{x^^). 



Fig. [sj (The various not-to-scale plots of hp{x^/'^) are shown for illustrative purposes in order to 
highlight the features of the plots. In contrast, all the phase diagrams plots are drawn to scale.) 

Let B.y C [0, 1]^ denote the set of all points (p, q) such that {q^ , h{q)) does not lie on the convex 
minorant of x i— )• hp{x^/'^). Each vertical section of is thus the interval {q"',q^) described in the 
previous paragraph. For example, B2 is the shaded region in Fig. [9j and the boundaries of Bj for 
additional values of 7 were plotted in Figure [2] (see Section [T]) . 

We shall show that each Bj resembles a rotated V-shape. More importantly, we show that B-y 
strictly contains By if 7 > 7'. 

The first lemma describes the shape of the function x 1— )• hp{x^/^). 



Lemma A.l. The function x 1— )• hp{x with domain (0, 1) is convex for < 7 < 1. If j > 1, 

and 

■= 7 - 1 +"eW(7-i) ' 

then the function is also convex. // 7 > 1 and < p < po, then the function has exactly two 
inflection points (both to the right of x = p^ ), with a region of concavity in the middle. Finally the 
function has infinite derivatives at both endpoints o/(0, 1). 

Proof. When < 7 < 1, x 1— )• x"^^^ is convex. Since the composition of two convex functions is 
convex we deduce that x 1— t- hp{x^^"') is convex. 
Next, assume d> 1. We have 



h'{x) = log 



log 



p 



and 



h''{x) 



1 



Therefore, 



and 



dx"^ 



1 — x p' ^ x(l — x) 

A/, (^1/7) = ixV7-l/,' (^1/7) 

dx 7 ^ 

/, (^1/7) = 1 ^2/7-2;^//(^l/7) + i ("i _ l^ ^l/7-2w (^1/^) 

7 7 VT / 



4/7-2 



x'/^h;{x'/^)-{j-i)h'p{x'/^)) . 
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0.2 0.4 0.6 0.8 1.0 



Figure 9. The region B2 consisting of all points {p,q) such that {q^,hp{q)) does 
not lie on the convex minorant of x i— )• hp{^/x). 



The claim on infinite derivatives easily follows from the above formulas. Setting x = q"' , we have 



x=q1 



{qh';{q) - - l)h'^{q))) 

,1-27 / I q 

- (7 - l)log:; h (7 - l)log 



1 



1 



1 — p 



(A.l) 



Hence, hp{x^/'~^) is convex x = q'^ whenever — (7 — 1) log > —(7 — 1) log 3^ and concave 
otherwise. The fact that 



d 
dq 



(7 - 1) log 



1 



79-7+1 



implies that — (7 — 1) log is decreasing until = (7 — l)/7 and then increasing afterwards. 
It diverges to +00 at both endpoints of (0, 1) and attains a minimum value of 7 — (7 — 1) log(7 — 1) 



at g = (7 — l)/7. The term (7 — 1) log 3^ of Eq. (A.l) is increasing for p € (0, 1) and surjective 
onto the reals. Therefore, hp{x^/'^) > for all x if 7 — (7 — 1) log(7 — 1) + (7 — 1) log > 0, 



which is equivalent to having p > 



7-1 



Additionally, if p < 



7-1 



7-l+eT/(i'-l) 



jy, then h''{x) starts 



7-l+e''/(T-l) 

as positive, becomes negative, then turns positive again. 

We next give an explicit description of the region 82- 

Lemma A. 2 (7 = 2 case). Let p,q € (0,1). The point {(f',hp{q)) lies strictly above the convex 
minorant of x ^ hp{^/x) if and only if p < (l + {q~^ — l)^/(i~2g)'j i_ 

Proof. We claim that the lower common tangent of x 1— )• hp{^^) has slope \og(^-^). To show this, 
it suffices to check that hp{^^) — x\og{^—^) has a horizontal common lower tangent, and it suffices 



to check the same thing for hp{x) — x^ log(^— 2). Observe that 



hpix) — x^ log = X log — h (1 — x) log - 

p p 1 



X log 



l-p 



p p 
X log X + (1 — x) log(l — x) — x(l — x) log 



l-p 



log(l - p) 
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Figure 10. Illustration of the convex minorant of rc i— t- hp{x^/'^) in the setting of Lemma A. 3 

X log 





1 

ql 



1 

qi 



7 

qi 



7 

qi 



Figure 11. Illustration of the functions in Eq. (A.3). 



is invariant under x i— >• 1 — x, so that its lower tangent must be horizontal by symmetry, and let it 
touch the curve at x = (7, so that Q < q <q < 1 are the zeros of the derivative, namely 



log 



1 



(1 - 2x) log 



P 
1 — p 



(A.2) 



It follows that (g^, hp{q)) lies strictly above the convex minorant if and only \i q < q < q, which is 
equivalent to having jzjiq 1^ ^ log . Rearranging the latter concludes the proof. ■ 

For 7 > 1 and < p < ^0(7); define q = q{'^,p) and q = q{j,p) to be such that the lower common 
tangent to x 1— )• hp{x^^"') touches the curve at points {q'^ , hp{q) and {q'^ , hp{q)). An examination of 
the geometry of the curve, as illustrated in Fig. flO) immediately leads to the following lemma: 



Lemma A.3. Let 7 > 1 and < p < Po{'j)- Let < gi < 52 < 1- U the line segment joining points 
{qj,hp{qi)) and (^2'^p('?2)) H^s below the curve {{q^,hp{q)) : < g < 1} and is not tangent to the 
curve at one of the end points, then this segment lies strictly above the lower common tangent of 
the curve. Consequently, q{'^,p) < qi < q2 < 9(7, p)- 

We now apply the above lemma to describe the shape of the regions B-f. 

Lemma A. 4. If j > 1 and < p < p' < po{'y), then q{-j,p) < q{'^,p') < q{l,p') < q{'y,p)- So 
is a rotated- V-shaped region. 



Proof. Let qi = q{'~i,p') and q2 = q{j,p')- As illustrated in Fig. 11, we have 

1 



hp{x 



l/7^ 



hpi{x 



l/7^+^l/7 1og^llL^+log 
p(l - pi) 



P 



1 — p 



(A.3) 
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Figure 12. The plot of x i-?- hp 



471 



following a change of variable x 



n'h 



The segment joining {qi,hp'{qi)) and (?2'^p'(92)) lies below x i— )• hp'{x^/"') by definition. Since 

X I— )• x^^^ log p^iZ^"^^ is concave, the segment joining x = qj and x = ql must also lie below the 

curve X i— >• hp{x^^'^), and it is not tangent to either endpoint due to the x^^"' term. The conclusion 
now follows from Lemma |A.3[ ■ 

Lemma A. 5. // 7 > 7' > 1 and < p < py, then q^jjp) < q{"l' -.p) < q{'~f',p) < q{l-,p)- So B-y 
strictly contains By. 

Proof. Let qi = (7(7', p) and q2 = qij' ,p)- Let I' denote the line segment joining points {qj , hp{qi)) 
and {ql ,hp{q2)), so that £' is tangent to the curve x 1— )• hp{x^/^'). 

Consider a transformation of the plots induced by the change of variable x 



u"' /7_ xhe plot of 

X I—)- hp{x^^^ ) becomes the plot of n 1— t- hp{u^^'^) (see Fig. 12). Originally i' was a line segment of 
positive slope lying below the curve x 1— )• and tangent to it at both endpoints. Following 

the transformation x = u"' (recall that 7'/7 < 1) we see that £' becomes a concave curve i still 



lying below the new curve u 1— t- hp{u and tangent to it at both endpoints. This implies that the 
line segment joining the two endpoints of i in the new frame lies below both curves and is tangent 



to neither at the endpoints. The conclusion then follows from Lemma A. 3 
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